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Two gradin.g distributions — a lib_erLaJ._,asLd„a„aiQxe - 

ve — were compared' between two sections of an 

psychology course. The two classes were team taught and 
Diilariy as possible* Four unit exams and an optional 
ministered, with one section consistently receiving a 
grade distribution. No differences in exam performance 
ts of study behavior and attendance occurred * between 
nt evaluations of course and instructors were 

higher in the class*^receivii>g more liberal" grading. The 
of the results to previous findings are discussed with 
hcdology and implications. (Author) 
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* Liberal grading improves evaluations 
but not. performance 



Ross Vasta ~ and Robert F- Sarmiento 

State University of New York College^ at Brockpojt 



yS OE^AKTMCNT OF HEALTH- 
COUCAT^ON 4WELFABE 
>4AH0NALIMSTITUTC0F 
EOOCATION 

TMt PERSON 0« 0«CANi?AT»ON OR»0»N- 
It»NC »t POINTS Of v«ew OR OPINIONS 

stATco 00 NOT Necesv^RicY ^^epre- 

"SInT 0*^P«OAt NATIONAL INSTiTU-^E OP 

eouCATiON pos-"ONOR pot'CY 



0^ 



Paper presented at the meeting of the American Hducati'ohal Research 
Association > Sau Francisco, April, 1976. 




\ 



/ 



ITie use of norm-referenced or ''curved^* grade distributions is 
widespread in coljege courses-^pa^ticularly , large ones. Although t;hxs 

form of grading, ha? been found to be as effective as a criteraTon- , ^ 

-refcreTiwd-ar~iih:nH-ntcr--gTaxliTig^ j.n maintaimnjr student^erformance ; " 

(Williams-,^ Pollack , ^5 Ferguson, 1975), its parameters have not beea""' 

investigated. In pjtrticulW, the effects of the leniency of this^gfade 

; ^ ; • '-^^ > ^ ^ — ^^T^ 

-felnrrbtiTTon remain to^'be explo-red. For example, does ^mo.re lib erial 

grading affect the quality of studeicit work and in which direction? 

The relationship between grades and course/instruetor evaluations -by 

students is. somewhat better understood* In general, most research has 

^hown that ^uch ev^ luations Vary positively with student grades 

(e.g., Dpyle^S Whitcly, 1974; Gessner> 1973; Kennedy, 19750. One 

difficulty with tliis research is that grades typically have been confounded 

with actual student performance* Whether it is the. receipt of the grade, 

or the student's ])articular level of work or performance, which is 

> •«* 

rdsponsible for the favorabJLe evaluatTon^ ^^LLJigL ^ ^^^^ 

way to address this question is through the use of separate groups Qf 
students,, whose perfoi^imiance does not differ, l)Ut who receive ^ i 
different grades. In contrast with a correlational approach (e^.g* , Konncdyy 
1975X, such an. experimental analysis would permit co;iclusix)ns relating 
the effects of grading on student evaluations. *" * 

• * Method ' • ' „ 

Subjects ' , ^ ^ 

Two sections of an undergraduate Developmental Psychology course' were 
*used, both meeting for 75 minutes on Tuesdays and Thursdays • One, section. 



meeting at l\ /i.'n. , had 134 students, while the other, meeting at 

2:00 p.m., had 116 students. ^'Botirs^eotionSj met in the same room, doVered 

^ • * - ' ^' ' 

the same material, at the Same rate, 'presented by the same two instructors 

' ^^'^ . ^ • 

^timl wore otheiviFc* treated as similarly as possible ^ * * . 

If ^ 

Procedure- . - . ^ 

TL^_s^emest^eii_\cas iiyldeAJ.ntQ.,f ouj. jw.Lt^^^ each 



.followed by a 40-'item multiple-chofce exam covering *that unit, with an, 
optional comprehensive final exam. Dr^ ^armiento taught the :^irst an^ • 



fourth units while Dr. Vasta was *xn charge'* of the second and'third units. 
Classes consisted primarily of lectures, supplemented by films and slides, 
for both instructors. The students were informed that their final, course 
gx*ad'e would be tlie niunerical average of the best four out of the five 



possible exams, with~^ade cutro^fs based dh the performance of the class. " 
The independent variable of the. experiment was the distribution of grades^ 
with the morning class, selected randomly by Dr* Sa^iento, receiving a 
distribution more difficult than used in previous seWsters of the course 
and the afternoon class receiving one more liberaJ. iDr, Vas\a was unaware 



of which group got whidi distribution while he was te:aching. For the^ 
first exam, the morning glass had 8% A,, 17% B, 38% C;\24% D an<l 13% E 



1 



grades,, while the Sftemoon^ class had.21% A, 27% B, 371% Ci and 10% D md 
5% E graces.. ^Similar distributions wfere -used for the second and third " 



exams,. Howeyer.. at the end of the course, final grades} were assigned 
accor ding to tlie liberal distributim iMiitcLth sect ions i -la^wpformance on- / 



the first exam represents a pretreatment comparison of the two groups 

— ^""'^ — ~ ^ — ] ^ ^^^^ " ~ 

since they had not -been assigned any grades yet. 

At each of tlie ft)ur*unit exams,' the students in* bothl sectieps filled 

put a short,' anonymous questionnaire concerning their studying, bel^aviprs 

and class attendance for that unit. During the fourth unit, they filled 



out a course e\alaation questionnaire and instructor evaluations for 

\_ • 
both j>rofcssorV^, nvzain aaonymously* 

^\ 

/llxo question^ of primary interest was UTiat effect would grade 
distribution have on exam performance^ studying behaviors and class 
attendance, and coursc/in'structoj - evaluations?'' * 
* Rosulfc-s- — — 



gj^am Performance 



^•^~"'^~1^^iT3"eores tor che: jheSnrng and atternoon sisctions oa the first 

^am v/ere 66.'7S% corrctt land. 66.6^% correct, respectively, with similar 
tlijtfibutions. As mentiSn^d' previously, this constitutes a pretrgatment 
comparison. Performance on the second, third and fourth exams was used* 
to examine the effects of the grade distribution manipulation. On each 



of these exams, there we»re only ^^light differences betweea the groups in 
means and distributions and* those were not stat:i^stically significant 
based on t-tests and Kolmogordv-Smimov tests CBp?coe> J969), More in 
depth analyses were ''also con.ducted. ' . , . 

The differential grading procedure resulted in some scores that 
would obtain the s.ame ^ letter grade in both sbotioRS.,. although, high ia thc. 
range of scores^ for t1;iat graSe in one section and low in the ottier. On the 



• assuitjption that a different letter gr^de would be more salient than 
differences witlun a grade category, these sV.ore$ were removed and only 
those students in each 5e-ction with scores that would giVe rxse to 



Hrfferent Tetter" grades on the first ex^un were compared on performance tar 

f>yam two. Again, n o fitntisti cally sign ificant differences in^means or 

distributions were obta-^hed* From this :group, students who on exam 2 
again^had received the same scores but .different letter grades were 
compared oh. exam 3 /with no significant differences occuring/ Of these, 
those 'with .scores on exam 3 giving rise to different letter grades were 



-c,^»par.oa on- ex..n J porfon^ance and again no signifidaixt differences were 
fdund. -i-hus, there is no evidence that tl4fe grading manipulation had any 
effe.ct oh exam porfonnance. 

Studying. Behaviors and Attendance " • . ^ 

Tlie i-esuitp of the -questionnaires ex^ining studying behavioxs and 

el-ass-ttirte;3d^nre--?hinre-d-Trd~s^^^ ~Th'e 

percentage of students from each, class returning the questionnaires . 

remained approximately the same for all, four unit examsi - ' 

Course/ Instructor Hva-liiations " . * 

The Instrument used to examine student evaluations of the course and 



instructors was the standard department ar-qu^stionnaire> not •designed'' 
specifically for use in this study., thus, some 'of the' items were not 



relevant to the current investigation and were not used- ifftthe 'analysis • 

The proi^rbions^of^stu^^ each section returning the questionnaire 

^""^^ ~— ^-t-^ — ~ ; ^ 
were not significantly different. 3 . . 

* * • . 

Two of the items on the questionnaire bear directly on the effective-. 

ness of the grade manipulation. One asked students >to predict their final 
-letter -g.r-ade-ih^ttie,. course and the other 'asked- them to rate the coarse 
grading s ystem from st rict to lejiieat . R esults -fo .r these items were 
consistent with -tlic differences /in gradie distributions between the sections 
with the former item significantly different between groups based on a 
t-test. . _ ^ ^ 



Of the remaining relevant items, sixteen were evalu.ations of the^ 
course n n d^slxtiieD-^a^±he^n,<^^,r »v" t ^> ^^ ^ for |he/other_instr»ytQt^/ 

There were also four demographic items assessing sex, grade point average, 

» .•»»*' 'I 

class and reason for taking the course. * ; ' ^ . - 

Six of the 48 evaluative items .showed significantly different mean 



reSDon50S bet:v<?^n r!ie two sections > with a = 0.05« On eacR of these 
items, Che group receiving thp more lenient distribution gave more 
favorable evaluations. Group differences- on three more items approached 
significance and only on one of these, -concerning the adequacy with which 
the exai^s covered the\pourse material, did^|||^^^gx^ 



^difficult distribution give a more favorable ratting. 

The two instructors were evaluated differently by both groups but 
a t-tcst .on difference ^cores between Llie Iwu groups for the 16 Ttoms — 



.concerning tl\e iustriictQrs showed no significant effect so these items 
were combined for/further .analysis* 'Considering only the direction of 
the ditrerences ^between groups, not the magnitude of these" drfBerences , 
24 of the 32 i/eins concerning iristruc1;ors showed a more favorable rating « 
by" the group ^ece'iriTig ^hp ientcnt^grade- distribution^^ 

♦ — ' rr- 

significam'ly fro^n chance expectations ^ = 8vOO, 1 d,f, ^ ^ < .QJl). Of 
-cthe, iteips concerning the course, 14 out of 16 were* rated more favorably 
by the group with the lenient distribution> again* a significant effect 
(^2 g.oo; 1 d.f.,- p < .01). ' , ^ ^ 

Discussion 

Thecfaiiure of the two grade distributions to differentially affect ^ 
student exam performance. or self-repor-ts of actual study behaviors and 
attendance ^is intrii'.uins. Since manipulation "checks" confirm that the. 
desired aspects of the treatment were implemented, the absence of differ- 
ential perfoiTnance> raises a number of questions. For exajnple, the, extent 
to which such distributioris interact with individ,l>al student characteristics 



or the extent to which these exams measure nonspeciific Academic ikills., are 
issues for future research. Perhaps, it is, most parsimonious to conclude 



simply that the\p,r;ule manipulation r although salient, was not effective 
in' altering studenrs! studying ..of exam behaviors,, which have l^een developed 
-aad-mHim^inecr wer a considerable netiod of time,. Tlie potential effects 
of su<Jh grading practices over a longer period of time or courses remain 

to be jnvestigated. " . , ^ : ^' 

— life "posit i\^eTffects bf higher grades on student evaluations are 
-7 >nnavnpni 10 p.i 1 1 W f fnH i fp p ., Kennedy, 1975) > Moreover, the y 
suggest that grades, per se, can produce this relationship > independent 

of th e students ' actual performance^ — \ ^ ^ 

^ Tnis finding might be used to support the current trend tpward 
liberal - grading, since such a practice does^not appear to adversely affect 
perforaance> y^t Increases evaluative responses, The generality--of^^^^he' - 
''current results, however, -remains to be established with a variety of 
students, cour^jes, and instructional methods, perhaps* a better inmiefdiate^ 
implication of the present findings is the use of caution in interpreting 
student evaluations. Since such data are 'often used to assess teacher or 
jnethpd eff.ectiyeiress , ^he identification of additional influencing variables 
question the wisdom of these practices. 
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